Sub-optimal tracking in switched systems with fixed final time and fixed mode sequence using reinforcement learning
نویسندگان
چکیده
Approximate dynamic programming is used to solve optimal tracking problems in switched systems with controlled subsystems and fixed mode sequence. Two feedback control solutions are generated such that the system tracks a desired reference signal, switching instants sought. Simulation results provided illustrate effectiveness of solutions.
منابع مشابه
Fixed-final-time optimal tracking control of input-affine nonlinear systems
In this study, approximate dynamics programming framework is utilized for solving the Bellman equation related to the fixed-final-time optimal tracking problem of input-affine nonlinear systems. Convergence of the weights of the neurocontroller in the proposed successive approximation based algorithms is provided and the network is trained to provide the optimal solution to the problems with a)...
متن کاملFixed vs Dynamic Sub-transfer in Reinforcement Learning Technical report
We survey various transfer methods in Q-learning, a type of reinforcement learning, and present a variation on fixed sub-transfer which we call dynamic sub-transfer. We describe the pros and cons of dynamic sub-transfer as compared with the other transfer methods, and we describe qualitatively the situations where this method would be preferred over the fixed version of sub-transfer.
متن کاملFixed vs. Dynamic Sub-Transfer in Reinforcement Learning
We survey various task transfer methods in Qlearning and present a variation on fixed sub-transfer which we call dynamic sub-transfer. We discuss the benefits and drawbacks of dynamic sub-transfer as compared with the other transfer methods, and we describe qualitatively the situations where this method would be preferred over the fixed version of sub-transfer. We test this method against sever...
متن کاملFixed-final-time optimal control of nonlinear systems with terminal constraints
A model-based reinforcement learning algorithm is developed in this paper for fixed-final-time optimal control of nonlinear systems with soft and hard terminal constraints. Convergence of the algorithm, for linear in the weights neural networks, is proved through a novel idea by showing that the training algorithm is a contraction mapping. Once trained, the developed neurocontroller is capable ...
متن کاملOptimal Sliding-Mode Guidance Law for Fixed-Interval Propulsive Maneuvers
An optimal strategy based on minimum effort control and also with terminal positionconstraint is developed for an exoatmospheric interceptor with a fixed- interval guidance time. It isthen integrated with sliding-mode control theory to drive an optimal sliding-mode guidance law forfixed-interval guidance time. In addition, this guidance law is generalized for intercepting anarbitrarily time-var...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2021
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2020.09.011